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Abstract 

We derive the exact distribution of the largest eigenvalue for finite dimensions real 
Wishart matrices and for the Gaussian Orthogonal Ensemble (GOE). We compare the 
exact distribution with the Tracy-Widom distribution which arises in many fields as the 
limiting distribution of the largest eigenvalue of large random matrices. In this regard 
we show that the Tracy-Widom distribution can be approximated by a properly scaled 
and shifted Gamma distribution, with great accuracy for the values of common interest 
in statistical applications. 

Keywords: Random Matrix Theory, characteristic roots, largest eigenvalue, 
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1. Introduction 

The distribution of the largest eigenvalue of Wishart and Gaussian random matrices 
plays an important role in many fields of multivariate analysis, including principal com- 
ponent analysis, analysis of large data sets, communication theory and mathematical 
physics HIl. 

The exact distribution of the largest eigenvalue for finite dimension uncorrelated 
central complex Wishart matrices is given in Q]. The extension to non-central uncor- 
related complex Wishart is derived in [4], while the case of correlated central complex 
Wishart matrices with arbitrary one-sided correlation can be obtained by following the 
approach in H, |f3, Q] . 

In this paper we derive a simple expression for the exact distribution of the largest 
eigenvalue for real Wishart matrices and for the GOE, which can be used for arbitrary 
dimensions. 

Also, we compare the exact distribution with the Tracy-Widom distribution, which 
arises in many fields as the limiting distribution of the largest eigenvalue of large ran- 
dom matrices, and whose applications include principal component analysis, anal ysis 
of large data sets, communication theory and mathematical physics [8, % Tfl 11, 12, 



13H . In this regard we show that the Tracy-Widom distribution can be approximated by 
a properly scaled and shifted Gamma distribution, with great accuracy for the values of 
common interest in statistical applications. 
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2. Exact distribution of the eigenvalues for finite dimensions Wishart and Gaus- 
sian symmetric matrices 



We derive the exact distribution of the largest eigenvalue for real Wishart matrices 
and for random symmetric Gaussian matrices. For completeness we also summarize 
the analogous distributions for the complex case (Wishart and Gaussian Unitary En- 
semble). 

2.1. Real random matrices: uncorrelated Wishart and the Gaussian Orthogonal En- 
semble (GOE) 

Assume a Gaussian real p x m matrix X with independent, identically distributed 
(i.i.d.) columns, each with zero mean and covariance S = I. Denoting n m i n = min{m, p}, 
"■max = max{m,p}, the distribution of the (real) ordered ei genv alues Ai > A2 . . . > 
An mi „ > of the real Wishart matrix W = XX T is given by H14lllll 



f X ( Xl , £„„„) = K e -*U2 x {^-n mm -l)/2 -Q ^ _ xj) (1) 

i=l 

where K is a normalizing constant given by 



i<3 



K ~ 2«.™«.™/ 2 r„ mi „(n max /2)r„ min (n min /2) (2) 

with r m (a) = 7r m (— 1 )/ 4 n™ 1 r(a - (* - l)/2) . 

Denotingx = [x\,X2, ■ ■ ■ , x„ mjn ], the probability distribution function (p. d.f.) in ([U 
can be written alternatively, in terms of the Vandermonde matrix V(x) with elements 

f x (x)=K\V( X )\l[e-*>/ 2 x? (3) 

»=i 

where a = (ri max — rt min — l)/2 and | • | stands for determinant. 

Similarly, for the Gaussian Orthogonal Ensemble the interest is in the distribution 
of the (real) eigenvalues for real nxn symmetric matrices whose entries are i.i.d. Gaus- 



sian Af(0, 1/2) on the upper-triangle, and i.i.d. 7V(0, 1) on the diagonal [13]. Their 



joint p.d.f. is / A (xi, . . . , x„ min ) oc n"=i e Xi/2 ' ]!"< , ( x i ~ x o) Hill- Then, by 



following the same approach as for Wishart, the joint p.d.f. of the ordered eigenvalues 
for these matrices can be written 

n 

/ A (x)=^ GOB |V(x)in e -^/ 2 (4) 

1=1 

where Kqoe = (2 n / 2 IJiLi T[*/2]) _1 is a normalizing constant. Note that the eigen- 
values here are distributed over all the reals. 
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Theorem 1. The cumulative distribution function (CDF) of the largest eigenvalue of 
the real Wishart matrix W is 



F Xx ( Xl ) = Pr{Ai < Xl } = Ky/\A( Xl )\ (5) 
where for even n m i„ the elements of the n m ;„ x n m ,„ skew-symmetric matrix A(xi) are 

Oijixi) = 2 2 "+^ { 7 (a + i, ^) 7 (a + j, ^) + 

r*i/2 "I 

-2y ^-^-^(a+i,*)*^ (6) 

fori,j = 1, . . . ,n m j n . Note that aij(x\) = —aj^ixt) and a^j(xi) = 0. 

When n m i„ is odd, the elements of the (n m j„ + 1) x (n mi „ + 1) skew-symmetric matrix 
A(xi) are as in (O, w/f/z f/ze additional elements 

ai, nmln +i{xx) = 2 Q+J 7 (w + h-W-J *=!,••-, "mm 

a »wH,i( :c l) = "^'.nmm+l^l) j = 1, ■ • ■ ,n-m/n 

a»i,„„,+i : n,„„,+i(a;i) = 
where 7(0, x) = t a ~ 1 e~ t dt is the lower incomplete gamma function. 
Proof 1. We start by writing 



F Xl {x x ) = Pr{Ai <xt}= I ■■■ I f x (wi, . . .,w nmta )dw (7) 

with T>(xi) — {wi, . . . , w nmin : xi > wi > W2 ■ ■ ■ > w nmln > 0}. 
Then we have 

[■[ fx(wi, . . . ,w nnln )dw = kJ ' ... J |V(w)| j]f(tfli) dw (8) 

w/iere £(x) = e~ x ' 2 x a . To evaluate this integral we recall that for a gen eric to x to 
matrix <&(w) with elements {&i(wj)} the following identity holds fl(k 17 1 



J - J mw)\dw=pf(A) 



(9) 

where Pf(A) = yj\ A| is f/ie Pfafftan, and the skew-symmetric matrix A is m x mfor 
m even, and (to + 1) x (to + l)for m odd, with 



'i.j 



\ \ sgn(y - x)$i(x)$j(y)dxdy i,j = 1, . . . , m. (10) 

J a J a 



For to odd the additional elements are di, m +i = — a m +l,i = / $j(x)dx, i = 
1, . . . , m, one/ a m+ i,m+i = 0. 

Using (0 /n ([8]) w/f/i a = 0,& = Xi,$i(x) = x l_1 e _:z: / 2 x a w;Y/i some simple 
manipulations gives Theorem\l\ 
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Theorem 2. The CDF of the largest eigenvalue for the Gaussian Orthogonal Ensemble 
( GOE) matrices is 

F Xl ( Xl ) = Pr{Ai < xi} = K GO eV\Mxi)\ (11) 
where for even n the elements of the n x n skew-symmetric matrix A.{x\) are 

e-*e- e IW,x)dt (12) 

-oo 

and 

W,x) = 2*- 1 (sgnix)^ (-l)T (0) 

fori, j = 1, . . . ,n. 

When n is odd, the elements of the (n+ 1) x (n + 1) skew-symmetric matrix A.(x\ ) 
are as in (1121 . w/f/i f/;e additional elements 

ai,„+i(xi) = ip(i,xi) i = l,...,n 
a n +i,j(xi) = -a^n+x{xi) j = l,...,n 

a n +i,n+i(xi) = 0. 

Proof 2. Starting from (|4|l f/ie proof is similar to that for Theorem\l\ 

2.2. Complex random matrices: uncorrelated Wishart and the Gaussian Unitary En- 
semble (GUE) 

Assume now a Gaussian complex p x m matrix X with i.i.d. columns, each with 
zero mean and co variance X. The distribution of the (real) ordered eigenvalues of the 
complex Wishart matrix W = XX ff is known since many years from 1 1411 in terms of 
hypergeometric functions of matrix arguments. Unfortunately, the expressions given in 
lfl4ll are not easy to use, due to the difficulties in evaluating zonal polynomials. The first 
expression of practical usage for the joint distribution of the eigenvalues of a complex 
Wishart matrix with correlation has been given in 15[] by expressing the hypergeometric 
function of matrix arguments as product of determinants of matrices. More recently, 
that approach has been expanded to cover the case where X has eigenvalues of arbitrary 
multiplicity, and to find several statistics regarding the marginal eigenvalues distribu- 
tion Ja, S IHtl ■ By using these approaches, the exact statistics of an arbitrary subset of 
the ordered eigenvalues can be evaluated easily for finite dimension complex quadratic 
forms and Wishart (uncorrelated and correlated) matrices. 

Regarding the largest eigenvalue statistic, below we report a known result for the 
particular case of uncorrelated complex Wishart matrices (i.e., for S = I). 

Theorem 3. The CDF of the largest eigenvalue of the uncorrelated complex Wishart 
matrix W is i3/ 

F Xl (x 1 ) = -Pr{\ 1 <x 1 }=K c \A(x 1 )\ (13) 
where the elements of the n m j n x n mi „ matrix A(xi) are 

Oijfa) = / t n '"°>- n """ +l+j - 2 e- t dt = 7 (rw - n mi „ Xl ) (14) 

Jo 
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and Kc is a normalizing constant given by 



K c = -= = (15) 



withf m (n)=7T m ^ i y 2 Uti( 



n — i 



The following is a similar result for complex hermitian random matrices with i.i.d. 
CAf(0, 1/2) entries on the upper-triangle, and Af(0, 1/2) on the diagonal. These ma- 
trices constitute the so called Gaussian Unitary Ensemble (GUE) 111 311 . 

Theorem 4. The CDF of the largest eigenvalue for the GUE is 

F Xl ( Xl ) = Pr {Ax < Xl } = K GUE \A( Xl )\ (16) 
where the elements of the n x n matrix A(xi) are 



Oij(si)= r t i+j - 2 e- t2 dt=l 



— OO 



7 (^ ± i-^,x?)^i) i+j '- i +(-i) i+j ' 



(17) 

and Kque = 2"(™~ 1 )/ 2 (7r"/ 2 Jl"=i 1 is a normalizing constant. 

Proof 3. For the GUE the joint distribution of the ordered eigenvalues can be written 

n 

/ A (x) =K GUE \V(x)\ 2 l[e- x > (18) 



as 4731/ 



|2 ' 

i=l 



Then, by using /@ 77z. 7] with a = —oo,b = xi,^i(xj) = $j(x_y) = x*- 1 ,£(a;) = 

_ 2 

e x we gef immediately the result. 

These thorems can be used for finite dimensional random matrices of limited di- 
mensions in the uncorrected case. For the extension of Theorem [3] to correlated com- 
plex Wishart see II 181 1611711 and references therein. 

We remark that the previous theorems can be used to obtain explicit expressions for 
the distribution of the largest eigenvalue. In fact, 7(0, x) can be written as combina- 
tions of exponentials and powers of x when a is an integer, and as the combination of 
exponentials, powers of x, and erf (a;) when a is a multiple of 1/2. Thus, we can write 
explicit expressions for the CDF and p.d.f. of Ai for all previous theorems. 

For example, by expanding © we derived the following expressions for the CDF 
of the largest eigenvalue for real Wishart matrices. 



(19) 
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Foi Timin — T^max 3. 



F Xl ( Xl ) = e -3x/2( e ,/ 2(e x_ x _ 1)erf /|_7^ (e , (x _ 1) + 1) 



7txc x (e x (x 2 - 4x + 6) - 2(i + 3)) erf 



(20) 



(a/2 (4e 2K - e x (a; 3 + 2a; 2 + 2x + 8) + 2 (a; + 2)) + 



(21) 



Similar expressions can be derived for the p.d.f, for complex Wishart, for GOE 
and for GUE. These expressions becomes cumbersome for large matrices. 



3. Limiting behavior for large random matrices: the Tracy- Widom distribution 



The pioneering works Q @] and iflHl have shown the importance of the Tracy- 
Widom distribution, which arises in many fields as the limiting distribution of the 
largest eigenvalue of large random matrices. This distribution, originally derived in 
the study of the Gaussian unitary ensemble, has been shown to be related to many ar- 
eas concerned with large random matrices. Applications include principal component 
analysis, analysis of large data sets, combinatorics, communication theory, representa- 
tion theory, probability, statistics and mathematical physics 

For example, it has been shown in |[Tll[l2Tl for principal component analysis (PCA) 
that if X is an n x p matrix whose entries are i.i.d. standard Gaussian and Ai is the 
largest eigenvalue of XX H , then for n, p — >• 00 



TW/3 



(22) 



where TW p denotes a random variable (r.v.) with Tracy-Widom distribution of order 
(3, for j3 — 1,2,4. In the previous expression /3 = 1 when the entries of X are standard 
real Gaussian, and (3 — 2 when the entries are standard complex Gaussian. We recall 
that a random variable Z is said to have a standard complex Gaussian distribution 
(denoted CAf (0,1)) if Z = (Z\ + 1Z2), where Z\ and Zi are i.i.d. real Gaussian 
A/"(0, 1/2). The scaling and centering parameters in (|22] > are III, 12 1 



= (V?i + ai + \Jp + a 2 y 
1 



1 



y/n + ai ^Jp + a 2 



1/3 



(23) 
(24) 



where the best adjustment parameters ai, a 2 are known to be ai = a 2 = —1/2 for 
real Wishart (J3 = 1) and a% = a 2 = for complex Wishart (f3 — 2). A similar 
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behavior can be proved for more general conditions when the entries of X are not 
Gaussian |T^, 21]. Due to the simplicity of this result, the Tracy- Widom distribution is 
of extreme usefulness for problems involving PCA with large dimensional matrices. 
The Tracy- Widom CDFs are given by HUB 



Fi (x) = exp 

F 2 (x) = cxp 



q{y) + (y- x)q 2 {y)dy 
(y - x)q 2 (y)dy 



cosh < - 



q(y)dyj ^F^xj 
where q(y) is the unique solution to the Painvale II differential equation 

q"(y) = yq'(y) + 2<z 3 (y) 

satisfying the condition 

q(y) ~ Ai(y) y -> oo 

and Ai(y) denotes the Airy function. 

The function F^x) can be derived from the other two. In fact, from 
d27l l we can write 



and 



1 

71 



Fi(x) 



fi(xV2) 



F l{ xV2) + ^- 
F 1 (xV2) 



f 2 (xV2)F 1 (xV2) - F 2 (xV2)f 1 (xV2) 
F?(xV2) 



(25) 

(26) 
(27) 

(28) 

(29) 
and 

(30) 
(31) 



where fp(x) = dFp(x)/dx is the p.d.f.. So in the following we will mainly focus on 
Fi(x) and i^) • 

These distributions can be evaluated numerically by solving the Painleve II differ- 
ential equation d28l i or the corresponding Fredholm determinant 

mm Ems m. 

In this paper we propose a very simple approximation for the Tracy-Widom dis- 
tribution, to avoid the need for numerical solution of differential equations of Fred- 
holm determinants. The approximation is shown to be extremely accurate for val- 
ues of the CDF or of the complementary complementary cumulative distribution func- 
tion (CCDF) of practical uses. 



4. A simple approximation of the Tracy-Widom distribution based on the gamma 
distribution 

It is known that the exact distribution of the largest eigenvalue of a complex Wishart 
matrix is a mixture of gamma distributions, i.e., its p.d.f. can be expressed as the 
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Table 1: Parameters for Approximating TVtfp with r[fc, 0] — a. 





TWi 


rw 2 




k 


46.446 


79.6595 


146.021 


e 


0.186054 


0.101037 


0.0595445 


a 


9.84801 


9.81961 


11.0016 



weighted sum of terms x a e~ bx (see ^ for the uncorrelated case and I^ ITsll for one- 
sided correlated Wishart matrices). For finite dimensions matrices, it can be shown that 
the exact distribution is very well approximated by a (single) gamma distribution, with 
proper parameters chosen to match the first moments of the true distribution. 
Based on these observations, we propose the approximation 

TWp ~G-a (32) 

where a is a constant, and Q ~ T(fc, 6) denotes a Gamma r.v. with shape parameter k 
and scale parameter 6. Thus the CDF and p.d.f. of TW ' p are approximated as: 

Fp{x) ~ flfc] 7 ( fc '^) ' X> ~ a (33) 

fp( x ) - prnp ( x + a )^ 1 » x>-a (34) 

where T[.] is the Gamma function, and 7 (k, x) — f Q t k ~ 1 e~ t dt is the lower incom- 
plete Gamma function. 

The parameters k, 9, a should be suitably chosen according to some criterion. For 
example, we have chosen to set k, 6, a for matching the first three moments of the 
distributions TWp. To this aim we recall that for the Gamma r.v. the meas is E {Q} = 
k8, the variance is var {Q} = k9 2 and the skewness is Skew {Q} = -j=. If /j,p,ap, Sp 
are the mean, variance and skewness of the Tracy-Widom (see e.g. 11211 ). matching the 
first three moments gives: 

k = ^ < 35 > 

8 = ap^- (36) 
a = k8 — [ip (37) 

The parameters for the approximation (l33l (l34l obtained from these equations are re- 
ported in Table Q] 

The comparison with pre-calculated p.d.f. values from [22] is shown in Fig. [T]Since 
in linear scale the exact and approximated distributions are practically indistinguish- 
able, in Fig. |2]we report the CDF and CCDF in logarithmic scale for Tracy-Widom 2 
(similar for the others). It can be seen that the approximation is in general very good 
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for all values of the CDF of practical interest. In particular there is an excellent agree- 
ment between the exact and approximate distributions for the right tail. The left tail 
is less accurate but still of small relative error for values of the CDF of practical sta- 
tistical uses. Note that, differently from the true distribution which goes to zero only 
asymptotically, the left tail is exactly zero for x < — a. 

pdf(x) 





0.5 








TW2 / 


\ 0.4 

/v\ - 3 














TWl 






0. \ 








-5 
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Figure 1 : Comparison between the exact (continuous line) and approximated (dashed) PDF for the Tracy- 
Widom 1 and Tracy-Widom 2. The exact and approximated curves are practically indistinguishable on this 
scale. 



CDF(x), CCDF(x) 
1 . 



0.01 - 

io- 4 - 

to- 6 - 

io- s - 



-5 5 

Figure 2: Comparison between the exact (continuous line) and approximated (dashed) CDF, CCDF, Tracy- 
Widom TW'2, log scale. The two CCDF are practically indistinguishable. 

Some specific values are given in Table [2] and [3] where it can be noted that, for 
values of common use, the relative error is small. 
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Table 2: Precision of the approximation: CDF of TWi vs. T[k, 9] — a for some percentiles. 



X 


Target 


CDF 


CDF 


CDF 




CDF 


[22J 


approximation 


rel. error (%) 


-4.64 


0.001 


0.0011 


0.0009 


-17.40 


-3.90 


0.010 


0.0099 


0.0095 


-4.02 


-3.18 


0.050 


0.0500 


0.0501 


0.16 


-2.78 


0.100 


0.1004 


0.1010 


0.65 


-1.91 


0.300 


0.3001 


0.3011 


0.32 


-1.27 


0.500 


0.4995 


0.4995 


-0.01 


-0.59 


0.700 


0.7006 


0.6998 


-0.12 


0.45 


0.900 


0.9000 


0.8996 


-0.04 


0.98 


0.950 


0.9500 


0.9500 


-0.00 


2.02 


0.990 


0.9899 


0.9901 


0.02 


3.24 


0.999 


0.9989 


0.9990 


0.01 



5. Numerical results 

The calculation of the exact distribution of the largest eigenvalue is easy by using 
Theorems 1-4 for not too large random matrices. For example, we show in Fig. [3]and 
Fig. |4]the distribution of the largest eigenvalue for real and complex Wishart matrices, 
with different n m ; n , n max . In the figure we report the exact distributions given by {3), 
(fT3l > and the centered and scaled Tracy- Widom distribution ( 122b (here we can use the 
exact Tracy-Widom or the approximations fl33l which are non distinguishable in this 
scale). 

In Fig. [SJand Fig. |6]we report the exact distribution for GOE (eq. ( fTTT i) and for GUE 
(eq. (TTBTl). for n — 2, 5, 10, 15, 20, 25. In the same figures we report the approximation 



based on the Tracy-Widom distribution, which in these cases is [8, 9l 1 1 311 : 

^Anv, (38) 

with fi' n = 2<ToyJn — a\ and <j' n = <7rj(n — a^) -1 ^ 6 , where a\ — 1/2 is the variance 
of the off-diagonal elements in the ensembles in our normalization. In the previous 
expression we must use ft = 1 and ft = 2 for the GOE and GUE, respectively. While 



1 1311 indicates a\ = 0,2 — 0, we have observed that the approximations are better 
for small n if we choose the adjusting parameters a\ = = 1/2 for the GOE and 
oi = 0,a 2 = 1 for the GUE. 

We note that, for large dimension problems, the asymptotic distributions predicted 
by the Tracy-Widom laws converge soon to the exact. In particular, for GOE and GUE 
the properly scaled and centered Tracy-Widom laws are already very close to the exact 
for very small matrices (n = 2). Also, we remark that the simple approximations (|33l , 
d34l l can be used instead of the pre-calculated tables for the Tracy-Widom distribution 
for values of practical interest in statistic. 
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Table 3: Precision of the approximation: CDF of TVV2 vs. T[k, 9] — a for some percentiles. 



X 


Target 


CDF 


CDF 


CDF 




CDF 


[22J 


approximation 


rel. error (%) 


-4.29 


0.001 


0.0010 


0.0009 


-8.89 


-3.72 


0.010 


0.0102 


0.0100 


-1.77 


-3.19 


0.050 


0.0505 


0.0506 


0.16 


-2.90 


0.100 


0.1003 


0.1006 


0.35 


-2.26 


0.300 


0.3025 


0.3029 


0.14 


-1.80 


0.500 


0.5022 


0.5021 


-0.02 


-1.32 


0.700 


0.7018 


0.7014 


-0.06 


-0.59 


0.900 


0.9012 


0.9011 


-0.02 


-0.23 


0.950 


0.9503 


0.9503 


0.00 


0.48 


0.990 


0.9901 


0.9901 


0.01 


1.31 


0.999 


0.9990 


0.9990 


0.00 
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Figure 3: CDF of the largest eigenvalue, real Wishart matrix, = 5, n m ax = 5, 10, 15, 20. Comparison 
between the exact distribution (5) (continuous line) and the scaled and centered TWi as in (22) (dotted). 




Fi gure 4; CDF of the largest eigenvalue, complex Wishart matrix, Ti m \ n — 5, fimax — 5,10, 15,20,25,30. 
Comparison between the exact distribution )13t (continuous line) and the scaled and centered 7~W2 as in 
(22) (dotted). 
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